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Abstract 

In 1956 Ryser gave a necessary and sufficient condition for a partial latin rectangle 

^^ ■ to be completable to a latin square. In 1990 Hilton and Johnson showed that Ryser's 

r \ \ condition could be reformulated in terms of Hall's Condition for partial latin squares. 

Thus Ryser's Theorem can be interpreted as saying that any partial latin rectangle R can 

be completed if and only if R satisfies Hall's Condition for partial latin squares. 

C^ i We define Hall's Condition for partial Sudoku squares and show that Hall's Condition 

for partial Sudoku squares gives a criterion for the completion of partial Sudoku rectangles 

that is both necessary and sufficient. In the particular case where n = pq, p\r, g|s, the 

result is especially simple, as we show that any r x s partial (p, 5)-Sudoku rectangle can 

^ ■ be completed (no further condition being necessary). 

m 

^ ■ 1 Introduction 

(N 

C^ ' A latin square of order n is an n x n array with entries from {1, 2, . . . , n} in which each symbol 

occurs exactly once in each row and column. An r x s latin rectangle of order n is an r x s 
array in which each cell is filled from the set {1, 2, . . . , n} of symbols and no symbol occurs 
more than once in any row or column. In 1956 Ryser [28] proved the following well-known 

. , - theorem. 

X 

H \ Theorem 1. Let Ri be an r x s latin rectangle on the symbols {1, 2, . . . , n}. Then Ri can be 

completed to form an n x n latin square on the symbols {1, 2, . . . , n} if and only if 

N(i) > r + s — n 

for all 1 < i < n, where N{i) is the number of times the symbol i occurs in Ri. 
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Figure 1: A (p, g)-Sudoku square. 



If n = pq, a (p, q)-Sudoku square is a latin square of order n which is partitioned into px q 
rectangles, each containing a set 

{(xp + i,yq+j) : I < i < p, I < j < q} 

of cells for some x and y, where 0<x<q — l,0<y<p — 1, and which have the property 
that each of these p x q rectangles contains each of the symbols 1,2, ... ,n exactly once. (See 
Figured!) 

The puzzle game Sudoku was invented by Howard Garns in 1979 [3]. The puzzle consists 
of a partial Sudoku square, in which a few of the cells are preassigned, and the objective 
is to complete the partial Sudoku square to a Sudoku square. Usually the puzzles found in 
newspapers and magazines use (3, 3)-Sudoku squares, although occasionally other types are 
used, for example (2, 3)- or (3, 4)-Sudoku squares. 

An r X s {p, g')-Sudoku rectangle on n = pq symbols is an r x s latin rectangle which is 
partitioned into px q rectangles plus, if p\ r and q\ s, along the right hand border, rectangles 
of size p X {s — s*) and along the lower border, rectangles of size {r — r*) x q, and finally 
a rectangle of size (r — r*) x (s — s*), where r* = \r/p\p and s* = [s/gjg, as illustrated in 
Figure [21 

Our first analogue for Sudoku squares of Ryser's Theorem is the following simple result. 



Theorem 2. Let p\r, q\s and n - 

to an n X n {p,q) -Sudoku square. 



pq. Any r x s {p,q)-Sudoku rectangle Rg can be extended 



Thus there is no let or hindrance to the extension; in this respect Theorem [2] is simpler 
than Ryser's Theorem. 

The second analogue of Ryser's Theorem is necessarily a bit more complicated than Ryser's 
Theorem itself. In order to describe it we first define some associated bipartite graphs Sa for 
a = 1, 2, ... , {r*/p) + 1 and S^ for /3 = 1, 2, . . . , {s*/q) + 1. Here S stands for "side" (of the 
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Figure 2: An r x s {p, g)-Sudoku rectangle. 

rectangle), and B stands for "bottom" (of the rectangle). These graphs are only defined if 
p\ r or q\ s respectively. 

Firstly, if p | r, graphs S** for a = 1, . . . , r* /p are defined with vertices 

'^(a—l)p+l 5 • • • ) "^ap 

(corresponding to the rows P(a-i)p+i, ■ ■ ■ , Pap) and vertices wi, . . . ,Wn (corresponding to the 
symbols 1, . . . ,n). The edge t'(a-i)p+jU'j is placed in S"* if symbol j does not occur in row 
P{a-i)p+i of Rs- Then, for each a, 1 < a < r*/p, from 5* we construct Sa by replicating 
vertex Vi q — {s — s*) times, the replications being denoted by Vij {I < j < q — {s — s*)), and 
by joining each replicated vertex of Vi to each of the vertices wi, . . . ,Wn that was joined to Vi 
in S"*. 

For Q = {r* /p) + 1, S"* is defined similarly. The vertices are 



^(a-l)p+l — Vr*+1, 



,Vr 



(corresponding to the rows Pr*+i, ■ ■ ■ , pr) and vertices wi, . . . , Wn- An edge Vr*+iWj is placed 
in S"* if symbol j does not occur in row Pr*+i- From 5* we construct Sa by replicating vertex 
Vi q — {s — s*) times, the replicated vertices being denoted by Vij {1 < j < q — {s — s*)), and 

by joining each replicated vertex to each of the vertices wi,. . . ,Wn that was joined to Vi in 

c* 

Ja- 

li q \ s, the bipartite graphs i?^ (1 < /3 < (s* /q) + 1) are defined in a similar way, but 
with columns and rows being interchanged, and with p, r, r* interchanged with q, s, s* . 
We are now able to state the second, more complicated analogue of Ryser's Theorem. 

Theorem 3. Let n = pq. An r x s {p,q)-Sudoku rectangle Rg can be extended to an n x n 
{p, q)-Sudoku square if and only if, when r \p, for 1 < a < \r/q] , the graph Sa has a matching 



from the replicated row vertices into the symbol vertices, and, when s \ q, for 1 < /3 < \s/q] , 
the graph Bp has a matching from the replicated column vertices into the symbol vertices. 

We have not been able to express these two conditions in the very simple way that Ryser's 
Theorem is expressed. Of course the existence of a matching can be expressed in terms of 
the satisfaction of a collection of inequalities — as Hall showed [TB] in 1935. (In this paper we 
reserve the terms "Hall's Condition" and "Hall Inequalities" for three more general analogous 
situations, one for partially coloured graphs, one for partial latin squares, and the third for 
partial Sudoku squares.) 

We now turn to the definitions of Hall's Condition for graphs. Hall's Condition for partial 
latin squares, and Hall's Condition for partial Sudoku squares. 

Let G be a finite simple graph. Let C be a set of colours, and let L : V{G) — t- 2 , the 
family of subsets of C, be a list assignment to G, so that each vertex is assigned a "list" or 
finite set of colours from C. A function / : V{G) — )• C is a vertex colouring of G provided 
that adjacent vertices have different colours. An L-colouring of V{G) is a vertex colouring 
/ : V{G) —?■ C such that f{v) € L{v) for each v € y{G). An L-colouring is often called a 
list- colouring. An independent set I of vertices of G is a set of vertices which does not contain 
any neighbouring pair of vertices. We let a{L, a, G) be the maximum number of vertices in 
an independent set of vertices of G, each of which contains a in its list. A necessary condition 
for there to exist an L-colouring of G is that 

Y,c^iL,a,H)>\ViH)\ (*) 

aeC 

for each subgraph H oi G. For a given subgraph H of G, inequality ([*]) is called Hall's 
Inequality for H, and the condition that "for each subgraph H of G, the Hall Inequality for 
H is satisfied" is called Hall's Condition for G. 

In the special case where G is a complete graph. Hall's Condition is sufficient to ensure 
that H has an L-colouring. (This is Hall's Theorem |16j . and an L-colouring in this case 
is a system of distinct representatives.) Hall's Condition for graphs has been the subject of 
several recent papers (see [71[8l[9l[ini[ni[ia[2Ql[2ll[22l[2ll[23]). In general, Hah's Condition 
for graphs is not sufficient to ensure there is a list-colouring. 

A latin square can be thought of as a proper vertex colouring of the cartesian product 
graph KnOKn with xi^n^Kn) = n colours. This graph can be visualised as an n x n grid 
in which the n vertices in each row are joined to each other, as are the n vertices in each 
column. However instead of giving a discussion in terms of graphs (which is easy enough to 
do) we shall give a more direct discussion using the more usual terminology for latin squares. 

In a partial nx n latin square Pi, some cells are filled from a set C of n symbols and some 
are empty. We can define a list Li{v) for each cell v as follows: 



Li{v) 



'the symbol in cell v, if cell v is filled; 

the set of symbols which are not used in the cells in 
the same row as v, nor in the cells in the same column 
as t;, if cell v is empty. 



A set of cells of Pi is called independent if no two cells occur in the same row or column. 
For a subset Q of cells of P;, we let a{Li,a, Q) be the maximum number of independent cells 
of Q all of which contain a in their lists. Hall's Inequality for a set Q of cells of P/ is that 

^a{Li,a,Q)>\Q\, 
o-gc 

where C is the set of n symbols available to be used in Pi. Hall's Condition is the collection 

2 

of all 2" inequalities obtained by letting Q range over all subsets of Pi . 

Hilton and Johnson in 1990 in [19j showed that Ryser's Theorem could be restated in the 
following way. 

Theorem [Ij. Let P; be an r x s latin rectangle on the symbols {1,2, . . . ,n}. Let Pi be a 
partial n x n latin square with Ri in the top left-hand corner, and the remaining cells empty. 
Then Pi can be completed to an n x n latin square if and only if Pi satisfies Hall's Condition 
for partial latin squares. 

In this case we do not need the whole set of inequalities comprising Hall's Condition. In 
fact if the Hall Inequality for the whole n x n partial latin square Pi is satisfied then the 
partial latin square can be completed. Thus the n inequalities of Ryser's Theorem can be 
replaced by just one inequality. 

Before turning to Hall's Condition for Sudoku squares it is convenient to define Gerechte 
Designs and Hall's Condition for them. Let {Pi,P2, . . . ,Pn) be a partition of the set of cells 
of an n X n latin square in which \Pi\ = n for each 1 < i < n. An n x n latin square is a 
(Pi,P2, . . . ,Pn)-Gerechte Design if each Pi contains each symbol exactly once. In the case 
when n = pq and each Pi is a p x q rectangle, then the Gerechte Design is a {p, (7)-Sudoku 
square. 

From a graph theory point of view, a (Pi, P2, . . . , Pn)-Gerechte Design can be thought of 
as a proper vertex colouring with n colours of the graph obtained from the cartesian product 
graph KnOKn (the vertices here correspond to the cells, and the vertex sets of the first and 
second K^s correspond to the rows and columns respectively) by placing an edge between 
two vertices if the corresponding cells are in the same part P^. 

We define Hall's Condition for partial (Pi, P2, . . . , Pn)-Gerechte Designs as follows. If Pd 
is a partial (Pi,P2, . . . ,P„)-Gerechte Design, for each cell r we define a list in the following 
way. 

{the symbol in the cell v of Po, if cell v is filled; 
the set of symbols which are not used in the cells in 
the same row as v, nor in same column as v, nor in 
the same part Pj of the partition containing v. 
A set of cells of Pd is said to be independent if no two cells occur in the same row, or 
the same column, or the same part. For a subset Q of the cells we let a{L£),a,Q) be the 
maximum number of independent cells of Q all of which contain the symbol a in their lists. 
Hall's Inequality for Q is 

^a(Lz),o-,Q) > IQl, 
o-gc 



and if Hall's Inequality is satisfied for each subset Q of Pq, then Pp is said to satisfy 
Hall's Condition for partial {Pi, P2, . . . , Pn)-Gerechte Designs, or, briefly, Hall's Condition 
for Cerechte Designs. 

Now we turn to discuss Hall's Condition for partial Sudoku squares. First let us note that 
we shall describe the px q subrectangles forming the {p, g)-Sudoku square as the big (Sudoku) 
cells, and the 1x1 subrectangles as the small (Sudoku) cells. A {p, g)-Sudoku square on a set 
C of n = pq symbols can be thought of as a proper vertex colouring of a graph obtained from 
the cartesian product graph KnOKn, in which, visualizing KnOKn as a grid, as described 
earlier, the p x q subrectangles (i.e. the big cells) correspond to complete bipartite subgraphs 
Kp^q. If the first K^ has vertices 

Vl,. . . ,Vp,Vp+i,. . . ,V2p, . . . ,V(^q_i)p+i,. . . ,Vqp, 

and the second Kn has vertices 

Wl,...,Wq, Wq+l ,...,W2q,..., W(p_l)q+l , . . . , Vpg, 

then the big Sudoku cells correspond to the Kp^q on the vertices 

'^{x-l)p+l ; • • • 5 "^xp and W(^y_i)qj^i , . . . , Wyq 

{x = 1,2, . . . ,q; y = 1,2, . . . ,p). In this case, for each small cell v, we define 

'the symbol in the cell v, if cell v is filled; 

the set of symbols which are not used in the small cells 
in the same row as v, nor in small cells in the same 
column as v, nor in the big pxq Sudoku cell containing 

A set of cells of Ps is said to be independent if no two cells occur in the same row, or 
the same column, or the same big cell. For a subset Q of the cells we let a{Ls,o',Q) be the 
maximum number of independent cells of Q all of which contain the symbol a in their lists. 
Hall 's Inequality for Q is 

Y,aiLs,c7,Q)>\Q\, 

aec 

and if Hall's Inequality is satisfied for each subset Q of Ps, then Ps is said to satisfy Hall's 
Condition for partial Sudoku squares. 

We shall show that Theorem [3] (together with Theorem [2]) can be reformulated as follows. 

Theorem [Sj. Let n = pq. Let Rs be an r x s {p, (7)-Sudoku rectangle on the symbol set 
C = {1,2,... ,n}. Let Ps be a partial (p, q')-Sudoku square with Rs in the top left-hand 
corner, and the remaining cells empty. Then Ps can be completed to an n x n (p, (7)-Sudoku 
square if and only if Ps satisfies Hall's Condition for partial Sudoku squares. 
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Figure 3: Big and medium cells, big and small rows. 

2 Partial Sudoku Squares: the simple case 

In this section we prove Theorem [2j First we need some preliminaries. 
A k- edge- colouring of a finite multigraph G is a map 

(^:E{G)^{l,2,...,k}, 

where E{G) is the edge set of G. For any vertex v € V{G), the vertex set of G, and any 
i E {1, . . . , k}, let Ei{v) be the set of edges incident with v of colour i. A /c-edge-colouring is 
called equitable if 

I \Ei{v)\ - \Ej{v)\ I < Ifor ahuG V{G) and i, j G {1, 2, . . . , A;}. 

The following lemma is easy to prove; it is due independently to McDiarmid [27j and de 
Werra |32l. 



Lemma 4. Let k > 1 be an integer, and let G be a bipartite multigraph. Then G has an 
equitable k- edge- colouring. 

Let L be a latin square of order n, and let S = {pi, . . . ,Ps), T = {qi, . . . ,qt) and U = 
(ri, . . . ,r„) be three compositions of n. (A composition of n is an ordered set of positive 
integers which sum to n.) An {S,T, U)- amalgamation of L is an s x t array constructed as 
follows. We place a symbol k in cell (i,i) every time that a symbol from 

{ri H h rfc_i + 1, . . . , n H h r^} 

occurs in one of the cells (a, (3) where 

a G {pi H Vpi-i + 1,... ,pi H Vpi} 



and 

(3 £ {qi-\ h qj-i + 1, . . . , gi H ^ Qj}. 

Let n > 1 be an integer and let S = {pi, . . . ,Ps), T = {qi, . . . ,qt) and U = {ri, . . . , r„) 
be three compositions of n. An (S,T,U)- outline latin square of order n is an s x t array 
containing n^ symbols from {1,. . . ,u} (each cell may contain more than one symbol), such 
that the following conditions hold: 

(i) row i contains symbol k pir^ times; 
(ii) column j contains symbol k qjVf^ times; 
(iii) cell {i,j) contains piqj symbols, counting repetitions. 

It is easy to check that the {S, T, [/)-amalgamation of a latin square of order n is an 
{S, T, C/)-outline latin square of order n. The following theorem, sometimes called the Amal- 
gamated Latin Square Theorem, shows that the converse is also true. It was proved by the 
second author in |17] in 1980. See pj and [2j for earlier work by Andersen and the second 
author on somewhat similar lines, and |18] for a clearer account of the Amalgamated Latin 
Square Theorem. 

Theorem 5. Let O be an {S,T,U)- outline latin square. Then there is a latin square L such 
that O is an [S,T,U)- amalgamation of L. 

We now turn to the proof of Theorem [2l and show that if n = pq, p\r and q\s, and if Rs 
is an r X s (p, (7)-Sudoku rectangle on the symbols 1, 2 . . . , n, then the partial n x n Sudoku 
square containing Rs in its top left-hand corner, the other cells being empty, can be completed 
to form an n X n {p, g)-Sudoku square. 

Recall that we call the p x q rectangles (some empty, some filled) of our partial (p, q)- 
Sudoku square Ps the big cells, and the 1x1 cells of Ps the small cells. The big cells are 
arranged in rows which we shall call big rows, and in columns which we shall call big columns. 
Each of the r/p big rows of Ps consists of s/q big cells which are filled, and (n — s)/q big cells 
which are empty. Similarly with the first s/q big columns. 

We start by placing the symbols 1, 2, . . . , n in each of the big cells which are empty. Then, 
for the first r/p big rows, working big row by big row, one after the other, we divide each of 
the big cells carefully into p sets of q small cells, all in the same small row. We shall call these 
sets medium cells. (See Figure El) 

Our first task will be to distribute the n symbols in the previously empty big cells in each of 
the first r/p big rows amongst the medium cells, so that each medium cell contains q symbols. 
The only constraint here is that each small row should contain each symbol exactly once. We 
do this one big row at a time. For this we construct a bipartite graph G as follows. One 
vertex set contains p vertices, say ri, . . . ,rp, representing the p small rows of some big row. 
The other vertex set contains n vertices wi, . . . ,Wn representing the symbols. The symbol 
vertex Wi is joined to the row vertex Vj by an edge if symbol i is not contained in row j of S. 
Each symbol vertex Wj has degree p — {s/q) = (n — s)/q and each row vertex Vj has degree 
n — s. We give G an equitable fc-edge-colouring with colours (s/q) + 1, . . . ,p. If symbol vertex 




medium cells to be filled 



Figure 4: One of the big cells in the {{s* /q) + l)-th big column. 

Wi is joined to row vertex Vj by an edge coloured k, then we place symbol i in row j in the 
fc-th big column. Since the edge-colouring is equitable, q symbols are each placed in the j-th 
medium cell in big column k. 

The analogous process is performed with the first s/q big columns, and in this case the 
medium cells are vertical and contain p symbols. 

At this point it is easy to check that we have an {S, T, C/)-outline latin square, where 

5 = (1,1,... ,l,p,p,...,p) 

T = {l,l,...,l,q,q,...,q) 

U={1,1,...,1) 

where S has r I's and {n — s)/q g's. By Theorem [H there is a latin square L of order n of 
which our outline latin square is the {S, T, C/)-amalgamation. Because each big cell was filled 
with the symbols 1,2,... , n, this latin square is in fact a Sudoku square. D 

3 Partial Sudoku squares: the general case. 

In this section we prove Theorem [3l Recall that, in this case, we shall assume that p \ r 
or q \ s (or both). Suppose q \ s. Then graphs Sa are defined for a = 1, 2, ... , {s* /q) + 1 
(= [s/g]). The tactic for the proof is to apply Theorem [5] in much the same way as in the 
proof of Theorem [21 except that we have also to consider the first [r/p] partly filled big cells 
in the {{s* /q) + l)-th big column, and, if p j" r, the first {s* /q) + 1 partly filled big cells in the 
{{r* /p) + l)-th big row. 

Proof of Theorem 0. We need to divide the unfilled part of the partly filled big cells into 
horizontally placed medium cells of size 1 x (g — (s — s*)) for the first r* /p big cells (see Figure 
H]), and, \ip\r, just the first r — r* horizontal medium cells in big cell {{r* /p) + 1, (s* /q) + 1). 
(See Figure \5\) We need to assign the symbols that have not been used in the filled part 
of these big cells between the horizontal medium cells in such as way that no symbol occurs 



9 



r 



Rs 



corner cells 



Figure 5: The partition of the corner cell. 



more than once in any small row. Once these medium cells are filled in the {{s* /q) + l)-th 
big column, then we can fill the empty horizontal medium cells in the first r* /p big rows and 
the first (r — r*) rows in the {{r* /p) + l)-th big row and in big columns {s* /q) + 2, . . . ,p by 
applying the argument in the proof of Theorem [2j This argument is not precisely the same 
because of the need to fill the horizontal medium cells in the small rows r* + 1, . . . , r, but this 
slight difference presents no difficulties. 

The matchings in the graphs Sa, for a = 1, 2, ... , {r* /p) + 1, are used to fill the horizontal 
medium cells in the {{s* /q) + l)-th big column. 

For 1 < a < r* /p^ if Sa has a matching Ma, and if vertex V(^a-i)p+x,y is joined to vertex Wj 
by an edge of M^, then the symbol j is placed in the x-th horizontal medium cell in the a-th 
big row and the {{s* /q) + l)-th big column. Exactly q — {s — s*) symbols are placed in this 
medium cell, and the symbols placed do not occur elsewhere in row (a — l)p + x. Moreover 
big cell (a, {s* /q) + 1) contains each symbol exactly once, \ip\r then the same technique is 
used to fill the x-th horizontal medium cell in row r* + x for 1 < x < r — r* and big column 
{s* /q) + 1, using the matching Mi^j.*/p-^j^i in the graph S(^r*/p)+i- Now we apply the argument 
of Theorem [2] to fill all the remaining medium horizontal cells in big rows 1, . . . , r* /p, and the 
remaining medium horizontal cells in the first r — r* small rows in big row (r* /p) + 1. 

Next if p\ r we apply the same argument using the matchings in the graphs Bjs (1 < /3 < 
s* /q) to fill in the vertical medium cells in big row {r* /p) + 1 and big columns 1,2,..., s* /q, 
and then to fill in the first s — s* vertical medium cells in big cell {{r* /p) + 1, (s* /q) + 1). 
(See Figure [71) Finally, using the argument of Theorem [2] again, we fill all the remaining 
vertical cells in big columns l,...,s*/g and the first s — s* vertical medium cells in big 
column {s* /q) + 1. 

In the big cell {{r* /p) + 1, {s* /q) + 1) we have (r — r*) x [s — s*) single small cells, all filled, 
s — s* filled vertical medium cells of size {p — {r — r*)) x 1, r — r* filled horizontal medium 
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Figure 6: Sa and the cells the matching is used to fill (symbol j goes in the i-th medium cell). 

cells of size 1 x (g — (s — s*)) and one empty cell of size (p — {r — r*)) x {q — {s — s*)). We 
fill all the empty big cells with the symbols 1,2, ... ,n, and all the incompletely filled big cells 
with the symbols from 1,2, ... ,n which do not already occur there. 

One may easily check that we now have an {S, T, C/)-outline latin square, where 

S = {l,l,...,l,p-{r- r*),p,p, ...,p) 

T ={l,l,...,l,q-{s- s*),q,q,...,q) 
U={1,1,...,1) 

where S has r I's and q — {r* + 1) p's and T has s I's and p — {s* + 1) g's. 

By Theorem [5l there is a latin square L of order n of which our outline latin square is 
the {S, T, C/)-amalgamation. Moreover, because the big cells all contain each of the symbols 
1,2, ... ,n exactly once, L is a Sudoku square which extends Rs. This proves the necessity. 

The sufficiency follows as the whole process can be reversed in a trivial way when Rs is 
extended to a Sudoku square L. D 



4 Partial Sudoku Squares and Hall's Condition 

In this section we prove Theorem [31 in which Theorem [3] is reformulated in terms of Hall's 
Condition for partial Sudoku squares. Let us remark that if ii^ is a subset of the set of cells 
of a partial Sudoku square, some of which are preassigned, then the Hall Inequality for H is 
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Figure 7: Cells of different sizes. 

satisfied if and only if the Hall Inequality for the subset of H consisting of its empty cells is 
satisfied. This is easy to see, and is proved in slightly different circumstances in, for example, 
[5], [21] or [23]. We shall just consider the most complicated case in Theorem [3l where p \ r 
and q \ s. The case when p \ r or q \ s is similar but easier; if p | r and q \ s there is nothing 
to prove. 

Proof of Theorem . By Theorem [Sj Rs can be completed if and only if Rs can be extended 
to an (r* +p) x [s* + q) (p, g)-Sudoku rectangle, say ijj. So we need only focus on whether or 
not we can fill the remaining small cells in the {{s* /q) + l)-th big column and {{r* /p) + l)-th 
big row that already contain some preassigned cells. (We shall call the partially filled big cells 
the restricted big cells.) 

We know from Theorem [3] that these restricted big cells can be divided into appropriate 
horizontal and vertical medium cells if and only if the graphs Sq, (1 < a < {r* /p) + 1) and 
Bp (!</?< (s* /q) + 1) have matchings into the symbol vertices. (Note that the big cell 
{{r* /p) + 1, (s* /q) + 1) has only r — r* horizontal medium cells and s — s* vertical medium 
cells, and the corresponding matchings are smaller.) 

By Hall's Theorem, a necessary and sufficient condition for the existence of such a matching 
Ma in Sa {I <a< {r*/p) + 1) is that 



U N{v) > \A\ 



for each subset A of the set of (replicated) row vertices of Sa (-^(w) is the set of neighbours 
of a vertex v). But this is essentially the same as Hall's Condition for the a-th restricted big 
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Figure 8: Two incomplete partial latin squares with n cells preassigned. 



cell in big column (s* /q) + 1, namely 



Y^ a{L,a,H)>\H\ 



cre{l,...,n} 

for each subset H of the restricted big cell. Here the q — {s — s*) replicates in Sa of a row vertex 
'^{a-i)p+j ill S^ correspond to the q— {s — s*) small cells in the j-th horizontal medium cell in 
the a-th big cell in big column {s* /q) + 1. A solution of just some of the row- vertex replicates 
corresponds to a subset H containing just some of the small cells in the j-th medium cell. A 
symbol a is available for inclusion as an end-vertex of an edge of a matching, or in a cell of 
H, if and only if in at least one of the rows of H there is no preassigned occurrence of the 
symbol. A symbol selected to be an end vertex of a matching may be selected to be placed 
in the corresponding row of H, and vice-versa. Thus in this case the two versions of Hall's 
Condition are essentially the same. 

The argument for the matchings in the bottom graphs -B/3 (1 < /3 < {s* /q) + 1) is, of 
course, the same. D 

5 Further comments 

5.1 Analogues for Sudoku squares of results about partial latin squares 

Hall's Condition for graphs has been the object of a certain amount of study (see e.g. [71 
[8l[9l[l0l[l3l[20l[22l[2ll[l3])and recently Hall's Condition for partial latin squares has been 
studied (see [191 1231 [23]). I^i every case that has been looked at so far, whenever there is a 
theorem saying that a certain kind of partial latin square can be completed if and only if such 
and such a condition is satisfied, then the condition (whatever it might be) can be replaced 
by Hall's Condition. However, there are examples of classes of partial latin square for which 
Hall's Condition is not a sufficient condition for completion (it is always necessary). 

It seems quite likely that there are analogues of these results for partial Sudoku squares. 
At present, we just have the example of Ryser's Theorem, given in this paper. It would also 
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Figure 9: An incompletable partial Sudoku square. 

be interesting to construct classes of partial Sudoku squares for which Hall's Condition is not 
a sufficient condition for there to exist a completion. 

It is worth remarking that there are really two natural kinds of partial Sudoku squares. In 
one kind the set of preassigned cells includes either none or all of the constituent small cells 
in any big cell (in the terminology of Section 3), and in the other this restriction is not made. 
Theorem [2] considers partial Sudoku squares of the former type and Theorem [3] considers 
partial Sudoku squares of the latter type. In the next subsection we will look at two Sudoku 
variants of the Evans Conjecture. 

5.2 Two Sudoku analogues of the Evans Conjecture 

In 1960 Trevor Evans [14J posed the now famous conjecture that bears his name: What is the 
least number of preassigned elements an n x n matrix can have and not be completable? The 
conjecture was n, and n would be the best possible because of the configurations in Figure El 
The Evans Conjecture was eventually proved independently by Smetaniuk [30] and Andersen 
and Hilton [3]. We can consider two variants of the Evans Conjecture for Sudoku squares. In 
the first, we ask: 

Problem 1. What is the least number of small cells that can be preassigned in a partial pxq 
Sudoku square, so that it is not completable? 

It seems reasonable to suppose that the answer is p + q — 1, and this bound can be 
attained (see Figure [9]). (This question has also been asked by Antal Ivanyi [26j.) Note 
that this includes the normal Evans Conjecture as a special case, as any latin square can be 
regarded as a (l,n)-Sudoku square. 

The second variant of the Evans Conjecture for Sudoku squares, is one where the big cells 
are either empty or completely filled. 

Problem 2. What is the least number of big cells that can be preassigned in a partial pxq 
Sudoku square, so that it is not completable? 

We conjecture that the least number is n = pq. This bound can be attained by the 
following construction. Let Ri,R2, . . . ,Rk be the rows of a k x k matrix M filled with the 
symbols 1,2,..., k'^, each symbol occurring once. Let Mi = M and, for 2 < i < k, let Mi be 
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Figure 10: An incompletable partial Sudoku square. 



the matrix 



Mi 



Ri+l 
Rn 

Ri 



\R^-l) 

Consider the partial k"^ x k"^ Sudoku square in Figure [TOl Then, looking at the first row and 
ik-ih. column, we see that there is no symbol that can be placed in cell {l,ik). 

5.3 Orthogonal latin squares and Hall's Condition 

Suppose that {A,B) is a pair of n x n orthogonal latin squares on the symbols 1,2, ... ,n. 
For 1 < i < n, let Pi be the set of cells of B that contain the symbol i. Then, obviously, 
(Pi, . . . , Pn) is a partition of the set of cells of an n x n matrix such that \Pi\ = n. Therefore A 
is a (Pi, . . . , P„)-Gerechte Design. In general, any latin square with an orthogonal mate can 
be thought as a kind of Gerechte Design. Given a latin square A with an associated partition 
(Pi, . . . ,Pn) corresponding to an orthogonal mate, we call A a (Pi, . . . , Pn)- orthogonal latin 
square design. 

The partition (Pi, . . . , P„) itself has the properties: 

(i) We can form a latin square B by placing symbol i in the cells of Pj (1 < i < n). 

(ii) There is a way of placing the symbols 1,2, ... ,n each exactly once in the cells of Pj 
(1 < i < n) so that we form another latin square A. 
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Then, of course, {A, B) is a pair of orthogonal latin squares. Any partition (Pi, . . . , P„) 
with properties (i) and (ii) is an OLS-partition of the nxn matrix. (OLS stands for "orthogonal 
latin square"). 

A partial (Pi, . . . , P„)-OLS design is a partial latin square which is a partial (Pi, . . . , P„)- 
Gerechte Design. Thus in a partial (Pi, . . . , P„)-OLS design, we are given the partition 
(Pi, . . . ,Pn), and we have a partial latin square in which each Pj contains each symbol at 
most once. 

The notion of Hall's Condition for a partial (Pi, . . . , Pn)-OLS design is just a particular 
special case of Hall's Condition for partial (Pi, . . . , P„)-Gerechte Design defined in Section 1. 
We shall refer to it briefly as Hall's Condition for OLS-designs. 

It would be fascinating if there were results analogous to Ryser's Theorem for OLS-designs. 
Our theorem about Sudoku squares suggests the following possibility: 

Problem 3. Let (Pi, . . . , P„) be an OLS-partition of an n x n matrix of cells. Suppose that 
we have an n x n partial latin square P^ in which Pi, . . . , P^ are filled with each Pj (1 < i < r) 
containing each symbol exactly once. Is it true that we can complete the partial latin square 
in such a way that each of Pr+i, . . . , Pn also contain each symbol exactly once if and only if 
Pi9 satisfies Hall's Condition for partial (Pi, . . . ,Pn)-OLS designs? 

5.4 Complexity 

We do not know the complexity of the decision problem: given a partial latin square, decide 
if it satisfies Hall's Condition for latin squares. This problem is known to be in co-NP |24) . 
but it is not known to be in NP. We can ask the same question about the complexity of the 
corresponding problem for Sudoku squares: given a partial Sudoku square, decide if it satisfies 
Hall's Condition for Sudoku squares. 

Colbourn [6] showed that the following decision problem is NP-complete: given a partial 
latin square, decide if it can be completed. Yato and Seta [33j showed that the corresponding 
problem for Sudoku squares is also NP-complete: given a partial Sudoku square, decide if 
it can be completed. We do not know if the problem remains NP-complete if the big cells 
are required to be either filled or empty, although it seems reasonable to suppose that this is 
indeed the case. 

In [31] it is shown that deciding if a partial Gerechte Design is completable is NP-complete, 
even if the partial Gerechte Design contains no filled cells at all. 

Hilton and Vaughan [21] have looked at the decision problem: given a partial latin square 
satisfying Hall's Condition for latin squares, decide if it can be completed. They showed that 
this problem is NP-hard. We do not know if this is true of the corresponding problem for 
Sudoku squares: given a partial Sudoku square satisfying Hall's Condition for Sudoku squares, 
decide if it can be completed. 

A maximum matching in a bipartite graph on n vertices can be found in O(n^) time (see 
e.g. [SHI Chapter 20]), so there is a polynomial algorithm to decide if a bipartite graph has 
a perfect matching. To decide if an r x s partial Sudoku square, as considered in Theorem 
[3l can be completed, we need to decide if each of n bipartite graphs has a perfect matching. 
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Consequently, the problem of deciding if such a partial Sudoku squares can be completed can 
be performed in polynomial time. 
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